AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Efficient attention mechanism

# Efficient attention mechanism

Tweety 7b Dutch V24a
Apache-2.0
Tweety-7b-dutch is a foundational large language model specialized in Dutch, based on the Mistral architecture, optimized for Dutch text processing with a Dutch tokenizer.
Large Language Model Transformers Other
T
Tweeties
1,568
13
Mistral 7B Instruct V0.2 Sparsity 20 V0.1
Apache-2.0
Mistral-7B-Instruct-v0.2 is an instruction-finetuned large language model improved from Mistral-7B-Instruct-v0.1, compressed to 2% sparsity using Wanda pruning method while maintaining competitive performance without retraining.
Large Language Model Transformers
M
wang7776
80
1
Mpt 7b 8k Instruct
Apache-2.0
MPT-7B-Instruct-8k is a model for long-format instruction following, especially good at answering questions and summarizing long documents.
Large Language Model Transformers Other
M
mosaicml
2,513
27
Chinese Bigbird Base 4096
Apache-2.0
Chinese pre-trained model based on BigBird architecture, supporting 4096-length context processing
Large Language Model Transformers Chinese
C
Lowin
48
3
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase